Evaluation of model adaptation by HMM decomposition on telephone speech recognition
نویسندگان
چکیده
In this paper, we evaluate performance of model adaptation by the previously proposed HMM decomposition method[1] on telephone speech recognition. The HMM decomposition method separates a composed HMM into a known phoneme HMM and an unknown noise and channel HMM by maximum likelihood (ML) estimation of the HMM parameters. A transfer function (telephone channel) HMM is estimated using adaptation speech data by applying the HMM decomposition twice in the linear spectral domain for noise and in the cepstral domain for channel. The telephone speech data for evaluation are recorded through 10 kinds of ordinary analog telephone handsets and cordless telephone handsets. The test results show that the average phrase accuracy with the clean speech HMMs is 60.9% for the ordinary analog telephone handsets, and 19.6% for the cordless telephone handsets. By the HMM decomposition method, the average phrase accuracy is improved to 78.1% for the ordinary analog telephone handsets, and 50.5% for the cordless telephone handsets.
منابع مشابه
Bayesian affine transformation of HMM parameters for instantaneous and supervised adaptation in telephone speech recognition
This paper proposes a Bayesian affine transformation of hidden Markov model (HMM) parameters for reducing the acoustic mismatch problem in telephone speech recognition. Our purpose is to transform the existing HMM parameters into its new version of specific telephone environment using affine function so as to improve the recognition rate. The maximum a posteriori (MAP) estimation which merges t...
متن کاملHybrid model decomposition of speech and noise in a radial basis function neural model framework
This paper focus on a new approach to automatic speech recognition in noisy environments where the noise has either stationary or non-stationary statistical characteristics. The aim is to perform automatic recognition of speech in the precence of additive car noise. The technique applied is based on a combination of the Hidden Markov Model (HMM) decomposition method [ 11, for speech recognition...
متن کاملModel Adaptation Based on Hmm Decomposition for Reverberant Speech Recognition
The performance of a speech recognizer is degraded drastically in reverberant environments. We proposed a novel algorithm which can model an observation signal by composition of HMMs of clean speech, noise and an acoustic transfer function(l]. However, how to estimate HMM parameters of the acoustic transfer function is a remaining serious problem. In our previous paperll], we measured real impu...
متن کاملModel adaptation based on HMM decomposition for reverberant speech recognition
The performance of a speech recognizer is degraded drastically in reverberant environments. We proposed a novel algorithm which can model an observation signal by composition of HMMs of clean speech, noise and an acoustic transfer function[1]. However, how to estimate HMM parameters of the acoustic transfer function is a remaining serious problem. In our previous paper[1], we measured real impu...
متن کاملA New Fast and Efficient HMM-Based Face Recognition System Using a 7-State HMM Along With SVD Coefficients
In this paper, a new Hidden Markov Model (HMM)-based face recognition system is proposed. As a novel point despite of five-state HMM used in pervious researches, we used 7-state HMM to cover more details. Indeed we add two new face regions, eyebrows and chin, to the model. As another novel point, we used a small number of quantized Singular Values Decomposition (SVD) coefficients as feature...
متن کامل